Material to A note on oligonucleotide expression values not being normally distributed

نویسندگان

  • JOHANNA HARDIN
  • JASON WILSON
چکیده

Motivation: Novel techniques for analyzing microarray data are constantly being developed. Though many of the methods contribute to biological discoveries, inability to properly evaluate the novel techniques limits their ability to advance science. Because the underlying structure, or distribution, of microarray data is unknown, novel methods are typically tested against the assumed structure of normally distributed data. However, microarray data are not, in fact, normally distributed, and testing against such data can have misleading consequences. Results: Using an Affymetrix technical replicate Spike-In data set, we showed that oligonucleotide expression values are not universally normally distributed under any of the standard methods for extracting expression values. The resulting data tend to have a large proportion of skew and heavy tailed values. Using data simulated under three models (normal, heavy tailed, and skewed), additionally, we showed that standard methodologies (for differential expression and gene similarity) can give unexpected and misleading results when the data are not normally distributed. Robust methods should be used when analyzing microarray data. Additionally, when evaluating new techniques, skewed and/or heavy tailed data distributions should be considered in simulations.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A note on oligonucleotide expression values not being normally distributed.

Novel techniques for analyzing microarray data are constantly being developed. Though many of the methods contribute to biological discoveries, inability to properly evaluate the novel techniques limits their ability to advance science. Because the underlying distribution of microarray data is unknown, novel methods are typically tested against the assumed normal distribution. However, microarr...

متن کامل

Oligonucleotide microarray data are not normally distributed

ABSTRACT Motivation: Novel techniques for analyzing microarray data are constantly being developed. Though many of the methods contribute to biological discoveries, inability to properly evaluate the novel techniques limits their ability to advance the science. Because the underlying structure, or distribution, of microarray data is unknown, novel methods are usually tested against the known st...

متن کامل

Effect of Alccofine on Mechanical and Durability Index Properties of Green Concrete (TECHNICAL NOTE)

In the modern era, many research works are being carried out throughout the world for finding out a suitable cementitious material for the replacement of cement. The supplementary cementitious materials (SCM) can be used as a replacement of cement in the construction industry to minimize the carbon dioxide emission which is implicated in global warming and climatic changes in the environment. T...

متن کامل

Comparison of background correction and normalization procedures for high-density oligonucleotide microarrays

Oligonucleotide microarrays are now becoming a widely used research tool in gene expression analysis. A large variety of preprocessing methods for raw intensity measures is available to establish per-gene expression values. For their evaluation, a small number of spike-in and dilution data sets has been published. However, calibration data sets with varying parameters such as percentage of diff...

متن کامل

Classification of oligonucleotide fingerprints: application for microbial community and gene expression analyses

MOTIVATION Oligonucleotide fingerprinting of ribosomal RNA genes (OFRG) is a procedure that sorts rRNA gene (rDNA) clones into taxonomic groups through a series of hybridization experiments. The hybridization signals are classified into three discrete values 0, 1 and N, where 0 and 1, respectively, specify negative and positive hybridization events and N designates an uncertain assignment. This...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009